TITLE OF THE INVENTION 
VIDEO DATA REPRODUCTION APPARATUS, SCHEDULE DATA, VIDEO 
DATA REPRODUCTION METHOD, AND VIDEO DATA REPRODUCTION 
PROGRAM 

CROSS-REFERENCE TO RELATED APPLICATIONS 
This application is based upon and claims the 
benefit of priority from the prior Japanese Patent 
Applications No. 2002-183336, filed June 24, 2002; and 
No. 2003-170739, filed June 16, 2003, the entire 
contents of both of which are incorporated herein by 
reference . 

BACKGROUND OF THE INVENTION 

1. Field of the Invention 

The present invention relates to a video data 
reproduction apparatus having a navigation function for 
video data contents, and schedule data, a video data 
reproduction method and a video data reproduction 
program that realize the navigation function. 

2. Description of the Related Art 

In accordance with recent widespread use of PCs, 
the Internet, DVDs, etc., various navigation functions 
have been proposed, which enable users to acquire, 
display, etc. information related to contents, such as 
movies, as well as to enjoy the contents. 

In particular, there is a demand for a function 
for enabling users to, for example, click a certain 
object in contents, using a mouse, thereby executing 
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a predetermined action related to the object. 
For example, when a user clicks a person or object in 
a moving picture, such as a movie, using a mouse, 
a narrative text concerning the clicked person or 
5 object is displayed. 

As a basic technique for realizing the above 
function, a method has been proposed for effectively 
defining the area for a particular object in a moving 
picture, using arbitrary space-time area information 
10 of an amount corresponding to a plurality of frames. 

However, almost nothing has yet to materialize as 
a specific system realizing the above feature. 

BRIEF SUMMARY OF THE INVENTION 
The present invention has been developed in 
15 light of the above, and aims to provide a video data 

reproduction apparatus having an effective navigation 
function for video data contents, and schedule data, 
a video data reproduction method and a video data 
reproduction program that realize the navigation 
20 function. 

The present invention also aims to provide a video 
data reproduction apparatus, method and program capable 
of performing a jump from a certain page to a target 
object to activate the object. 
25 The invention further aims to provide schedule 

data for effectively displaying an object. 

According to a first aspect of the invention, 
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there is provided a video data reproduction apparatus 
comprising: a selecting unit configured to select 
an object to be reproduced in video data contents and 
configured to obtain a corresponding object related 
5 data, the object related data being related to the 

object and including a start time at which the object 
appears in the video data contents; a first acquisition 
unit configured to acquire the start time from the 
object related data; and a reproduction unit configured 

10 to start reproducing the video data contents based on 

the start time. 

According to a second aspect of the invention, 
there is provided a schedule data associated with 
an object which appears in video data contents, the 

15 object being reproduced by a reproduction apparatus, 

the schedule data comprising: a start time at which 
the object appears in the video data contents. 

According to a third aspect of , the invention, 
there is provided a video data reproduction method 

20 comprising: selecting an object to be reproduced in 

video data contents; obtaining a corresponding object 
related data, the object related data being related to 
the object and including a start time at which the 
object appears in the video data contents; acquiring 

25 the start time from the object related data; and 

starting reproducing the video data contents based on 
the start time. 



According to a fourth aspect of the invention, 
there is provided a video data reproduction program 
stored in a medium which can be read by a computer, 
comprising: means for instructing the computer to 
select an object to be reproduced in video data 
contents and obtain a corresponding- object related 
data, the object related data being related to the 
object and including a start time at which the object 
appears in the video data contents; first acguisition 
means for instructing the computer to acquire the start 
time from the object related data; and means for 
instructing the computer to start reproducing the video 
data contents based on the start time. 

According to a fifth aspect of the invention, 
there is provided a video data reproduction apparatus 
which reproduces an object, the object being appearing 
in video data contents, the video data reproduction 
apparatus comprising: a first acquisition unit 
configured to acquire schedule data which includes 
a start time at which the object appears in the video 
data contents; a second acquisition unit configured to 
acquire the start time from the schedule data; and 
a reproduction unit configured to start reproducing 
the video data contents based on the start time. 
BRIEF DESCRIPTION OF THE SEVERAL VIEWS OF THE DRAWING 

FIG. 1 is a block diagram illustrating a video 
reproduction apparatus according to a first embodiment 
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of the invention; 

FIG. 2 is a view illustrating a system configura- 
tion example of the video reproduction apparatus of 
FIG. 1; 

5 FIG. 3 is a view illustrating the structure of 

schedule data; 

FIG. 4 is a view illustrating a schedule of the 
period of the activated state of each object based on 
certain schedule data; 
10 FIG. 5 is a view illustrating the structure of 

mask data; 

FIG. 6 is a view illustrating the structure of 
object data; 

FIG. 7A is a view illustrating an object moving in 
15 frames with lapse of time; 

FIG. 7B is a view illustrating the frames 
appearing in FIG . 7A; 

FIG. 7C is a view illustrating the object 
appearing in FIG. 7A; 
20 FIG. 8 is a view useful in explaining a reverse 

link operation; 

FIG. 9 is a flowchart useful in explaining the 
operation of the video reproduction apparatus performed 
during the reverse link operation; 
25 FIG. 10 is a flowchart useful in explaining the 

operation of the V-click engine, appearing in FIG. 1, 
performed during the reverse link operation; 
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FIG. 11 is a view illustrating a video display 
window subjected to mask processing; 

FIG . 12 is a view illustrating a state in which 
a visual object and a cursor overlap each other; 
5 FIG. 13 is a view illustrating a state in which 

a balloon protrudes from the visual object; 

FIG. 14 is a flowchart useful in explaining the 
operation of the video reproduction apparatus of FIG. 1 
during the reproduction operation of the apparatus; 
10 FIG. 15 is a flowchart useful in explaining the 

operation of the V-click engine appearing in FIG. 1; 

FIG. 16 is a flowchart useful in explaining the 
operation of the V-click decoder appearing in FIG. 1; 

FIG. 17 is a view illustrating a state in which 
15 the video reproduction apparatus is connected to 

a server via the Internet; 

FIG. 18 is a block diagram illustrating a video 
reproduction apparatus according to a second embodiment 
of the invention; 
20 FIG. 19 is a table illustrating the relationship 

between the contents of V-click data items and contents 
IDs; 

FIG. 20 is a table illustrating the relationship 
between URLs indicating the locations of V-click data 
25 items and contents IDs; 

FIG. 21 is a flowchart useful in explaining 
downloading of V-click data; and 
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FIG. 22 is a flowchart useful in explaining 
certification of V-click data. 

DETAILED DESCRIPTION OF THE INVENTION 
Embodiments of the invention will be described 
5 with reference to the accompanying drawings. 

FIG. 1 is a block diagram illustrating a video 
reproduction apparatus according to a first embodiment 
of the invention. 

A video reproduction apparatus 1 of the first 
10 embodiment contains a DVD- video playback engine 

(hereinafter referred to simply as a "playback engine") 
2 for reproducing a DVD, and extension navigation 
engine (hereinafter referred to simply as an "ENAV 
engine") 3. 

15 The video reproduction apparatus 1 may have one or 

more functions similar to those employed in 
conventional DVD players. 

A DVD 100 stores ENAV contents 101 and DVD 
contents 102. The DVD contents 102 include, e.g., 

20 audio/video contents (such as movie contents), or. just 

video contents. ENAV contents 101 contain information 
related to particular DVD contents 102 that correspond 
to the ENAV contents. The ENAV contents are, for 
example, audio data, still image data, text data, 

25 animation data, and V-click data that provides an 

environment for a V-click described later. In the 
embodiment, three types of V-click data items, i.e., 
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schedule data, mask data and object data, are provided 
for one DVD contents 102. These three types of V-click 
data items are described in detail later. 

The ENAV engine 3 detects a user event signal that 
5 is generated as a result of a user operation made in 

association with DVD contents 102, thereby reproducing 
the DVD contents 102, displaying information related to 
the DVD contents 102, or displaying information related 
to a particular object during reproduction of the DVD 

10 contents 102. 

The ENAV engine 3 comprises an AV renderer 31, 
element decoder 32, user event controller 33, extension 
navigation interface handler (hereinafter referred to 
simply as a "handler"), parser/interpreter 35, and 

15 V-click engine 36. The element decoder 32 includes 

a V-click decoder 321. 

The playback engine 2 reproduces a desired part of 
DVD contents 102 recorded on a DVD. In other words, 
the playback engine 2 reads DVD contents 102 recorded 

20 on a DVD, decodes (or decrypts and decodes) it, and 

outputs the decoded data to the AV renderer 31. 

The playback engine 2 detects, via the user event 
controller 33, a user event signal that occurs as 
a result of a user operation. The user event signal 

25 contains, for example, a reproduction instruction, 

fast- forward instruction, random-access instruction, 
instruction to change the quality of a to-be-reproduced 
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video image, and instruction to jump the position of 
reproduction. When the playback engine 2 receives a 
user event signal instructing reproduction, it supplies 
the handler 34 with the name of a to-be-reproduced file 
5 in DVD contents 102 (this name may hereinafter be 

referred to as a "reproduction filename"). 

The playback engine 2 outputs, to the handler 
34 periodically or when necessary, the time stamp of 
image data that is being reproduced. The time stamp 

10 indicates the current time of reproduction and 

corresponds to the reproduction position of the DVD 
contents 102 (image data) that is being reproduced. 
On the other hand, the playback engine 2 receives, via 
the handler 34, a command for controlling the reproduc- 

15 tion of DVD contents 102. Specifically, the playback 

engine 2 receives, from the handler 34, the name of 
the to-be-reproduced file in the DVD contents 102 and 
a reproduction start time, thereby starting the 
reproduction at the position that the reproduction 

20 start time indicates. 

The AV renderer 31 receives decoded data from 
the playback engine 2. Further, the AV renderer 31 
receives, from the element decoder 32, element data 
including video data and audio data generated based on 

25 ENAV contents 101. The element data is, for example, 

audio data, still image data, text data or animation 
data. The element data also includes highlight element 



data, mosaic element data, etc. However, the type of 
image processing is not limited to the above, and 
a number of image processes exist. 

The AV renderer 31 overlays the data received from 
the playback engine 2 on the element data received from 
the element decoder 32. After that, the AV renderer 31 
converts the overlaid data into an analog video signal 
and/or an analog audio signal that can be output to 
a monitor apparatus 8 having a display window 81 and 
speaker 84, etc., described later. Of course, there 
may be a case where no element data is output from the 
element decoder 32 and therefore is not overlaid on the 
data from the playback engine 2. In this case, video 
data is reproduced on the basis of only the data only 
received from the playback engine 2 . 

From the input ENAV contents 101, the element 
decoder 32 generates element data, and outputs this 
data to the AV renderer 31. 

The V-click decoder 321 processes V-click engine 
data received from the V-click engine 36. This V-click 
engine data indicates, for example, a non-mask area, 
balloon contents and a balloon display position, etc. 
at the present time. 

Upon receiving the balloon contents and the 
balloon display position from the V-click engine 36, 
the V-click decoder 321 stores it in an internal memory 
of the V-click decoder 321. If the balloon contents 



and the balloon display position are already stored in 
the internal memory, they are updated with new data* 
When mask display is performed, it is very possible 
that the data in the non-mask area is frequently 
updated. The V-click decoder 321 checks the internal 
memory, and generates a balloon on the basis of the 
balloon contents and the balloon display position if 
the balloon contents is stored in the internal memory. 
The generated balloon is recorded to the internal 
memory. 

On the basis of the present time, the V-click 
decoder 321 generates a mask based on the pattern 
and color designated by mask data, described later, 
contained in the ENAV contents 101. At this time, 
the V-click decoder 321 receives the reproduction 
filename from the parser/interpreter 35,' thereby 
executing mask-processing based on mask data 
corresponding to the reproduction filename. 

The user event controller 33 processes the 
instruction (s) . contained in the above-described 
instructions contained in the user event signal that 
occurs as a result of a user operation. The user event 
controller 33 receives a user event signal and outputs 
a predetermined module. The type of module depends 
upon the type of instruction contained in a user event 
signal . 

For example, a user event signal, which contains 



12 



an instruction to perform reproduction and an 
instruction to change the image quality of reproduced 
video data, are only output to the playback engine 2. 
On the other hand, a user event signal containing an 
5 instruction to jump the position of reproduction, is 

output to both the playback engine 2 and handler 34. 
Further, instructions contained in click events signals 
caused by a V-click, a click related to a reverse link 
operation, other types of click, etc., are supplied 

10 from the user event controller 33 to the handler 34. 

Furthermore, the user event controller .33 receives user 
event signals for instructing ON/OFF of mask display, 
ON/OFF of balloon display, etc., and outputs 
predetermined modules. 

15 The handler 34 receives, from the user event 

controller 33, an instruction to jump the position 
of reproduction, fast-forward instruction, random- 
access instruction, etc. The handler 34 receives, 
periodically or when necessary, the time stamp of image 

20 data that is being reproduced. The handler 34 also 

receives the reproduction filename from the playback 
engine 2. The handler 34 outputs the reproduction 
filename to the parser/interpreter 35 and V-click 
engine 36. The handler 34 further receives, from the 

25 parser/interpreter 35, a DVD reproduction instructing 

signal containing information indicative of the 
reproduction filename and reproduction start time. 
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On the basis of the input data, the handler 34 supplies 
the playback engine 2 with a command for controlling 
the reproduction of DVD contents 102. The handler 34 
also supplies the user event controller 33 with a 
5 control signal for the controller 33, and supplies the 

AV renderer 31 with a control signal for controlling 
video data and audio data. 

The parser/interpreter 35 reads, when necessary, 
a script from ENAV contents 101, interprets the script 

10 and executes the script on the basis of the interpreta- 

tion. This script is used to, for example, control 
the reproduction of DVD contents 102, display and 
reproduction of information relating to the DVD 
contents 102, and acquisition, display and reproduction 

15 of predetermined contents from the Internet. 

Further, the parser/interpreter 35 receives 
a script from the V-click engine 36, interprets the 
script and executes the script on the basis of the 
interpretation. This script is output from the V-click 

20 engine 36 when a user click event causes a certain 

object to be hit. 

When the user event controller 33 has input a user 
event signal, the parser/interpreter 35 receives the 
instruction (s) contained in this user event signal via 

25 the handler 34. For example, if the instruction is 

the issuance of a mask display ON/OFF instruction, the 
parser/interpreter 35 supplies the V-click engine 36 
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with a signal for setting ON/OFF of mask display. 

Further, if the instruction contained in a user 
event signal is an instruction to click an object, the 
parser/interpreter 35 supplies the V-click engine 36 
5 with a signal indicative of the position of this click 

generation. After that, the parser/interpreter 35 
receives a script corresponding to the instruction 
contained in the signal from the V-click engine 36, 
and executes the script. In another example, the 

10 parser/interpreter 35 acquires, from tag data contained 

in a user event signal, the filename, reproduction 
start time, and the label of an object, thereby 
supplying the handler 34 with an instruction to 
reproduce the to-be-reproduced file in the DVD contents 

15 102 and reproduction start time. 

The V-click engine 36 reads data from the "MPEG7 
field" contained in an object file, and waits for 
a user click event (V-click) signal. The click event 
signal designates a point of space-time (i.e., a space- 

20 time point) . In other words, a click event signal 

designates space coordinates and a time coordinate. 
The V-click engine 36 determines whether or not the 
point given by a click event signal is included in the 
three-dimensional area (i.e., two space dimensions + 

25 one time dimension) occupied by an object. If it is 

included, the V-click engine 36 determines that a click 
corresponding to the click event signal has hit an 
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object, and the parser/interpreter 35 executes a script 
contained in object data of the object, or jumping to 
a URL contained in the object data. 

There is a case where data in the "MPEG7 field" 
5 describes a curve in space-time. For example, data 

concerning the position of a balloon describes such a 
curve. In this case, the V-click engine 36 determines 
whether or not the point clicked by a user is included 
in the balloon. If it is included, the V-click engine 

10 36 determines that this click event has caused an 

object to be hit, thereby executing a script contained 
in object data of the object. 

The V-click engine 36 sets ON/ OFF of mask display 
under the control of the parser/interpreter 35. 

15 Further, the V-click engine 36 stores the state of 

mask display ON/ OFF as a flag in an internal memory. 
Similarly, the V-click engine 36 sets ON/OFF of balloon 
display, utilizing the parser/interpreter 35, and 
stores the state of balloon display ON/OFF as a flag in 

2 0 the internal memory. 

Upon receiving, from the parser/interpreter 35, 
the reproduction filename and an instruction to read 
corresponding contents, the V-click engine 36 reads 
a schedule file corresponding to the reproduction 

25 filename from ENAV contents 101. If necessary, the 

V-click engine 36 reads, from the ENAV contents 101, 
an object file corresponding to the schedule file. 
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Further, the V-click engine 36 receives, from the 
handler 34, the time stamp of a reproduction image 
contained in the DVD contents 102. If necessary, the 
V-click engine 36 computes the configuration of an 
5 object on the basis of the time stamp and object data, 

thereby detecting, for example, the non-mask area and 
the balloon display position, etc., and extracting 
corresponding balloon contents from the object data. 

Upon reading an object file, the V-click engine 36 

10 causes the object to be ready. For example, when a 

click event (V-click) signal occurs, the V-click engine 
36 registers this object as one of the objects for 
determining whether or not the time and the click 
position will be just hit. There is a case where 

15 labels assigned to a plurality of objects are pre- 

registered as non-mask objects in the V-click engine 
36. For example, ENAV contents 101 contain the labels 
of to-be-registered non-mask objects, and registration 
is performed on the basis of these labels. If the mask 

20 display is ON, the V-click engine 36 computes the 

configuration of a non-mask object at the present time, 
and outputs the computed configuration as a non-mask 
area to the V-click decoder 321. If a plurality of 
non-mask objects exist, the logical sum of the non-mask 

25 areas is output as a non-mask area to the V-click 

decoder 321. 

There is also a case where labels assigned to 
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a plurality of objects are pre-registered as balloon 
display objects in the V-click engine 36. For example, 
ENAV contents 101 contain the labels of to-be- 
registered balloon display objects, and registration is 
5 performed on the basis of these labels. If the balloon 

display is ON, the V-click engine 36 computes the 
position of a balloon display object at the present 
time and the balloon display position, and outputs 
the contents and the balloon display position of 

10 the balloon display object to the V-click decoder 321. 

The balloon display position is predetermined. For 
example, this position is above and to the right of 
an object. The V-click engine 36 may be constructed so 
that it can change the balloon display position, such 

15 as to be left or right of an object, or above left or 

right of the object, by appropriately changing the 
variable of the engine 36. Upon receiving a click 
event signal, the V-click engine 36 determines whether 
or not the objects in the ready state include an object 

20 . that contains the space-time point designated by the 

click position and the present time. The V-click 
engine 36 compares the reproduction end time of each 
object with the present time, thereby releasing, from 
the ready state, each object whose reproduction end 

25 time has been exceeded, and deleting it from its 

internal memory. The script of each hit object is send 
to the parser/interpreter 35. 



The V-click engine 36 outputs any non-mask area 
and balloon contents and a position to the V-click 
decoder 321. The V-click decoder 321 processes the 
V-click engine data supplied from the V-click engine 
36. Specifically, the decoder 321 synthesizes the 
images corresponding to V-click engine data, and 
generates masks. 

A description will now be given of the operation 
of the V-click engine 3 6 performed during a reverse 
link operation. A link operation indicates, for 
example, displaying, on a browser screen, information 
that a predetermined URL indicates when an object in 
a moving picture is clicked. On the other hand, a 
reverse link operation indicates an operation opposite 
to the link operation. For example, the reverse link 
operation indicates displaying, on the moving picture, 
the video data contents corresponding to the object 
that a object name indicates, when a button selected 
from a list of object names displayed on the browser 
screen is clicked. 

In the reverse link operation, firstly, the 
V-click engine 36 clears all objects that are currently 
in the ready state. Subsequently, the engine 36 reads 
schedule data corresponding to the reproduction 
filename. The schedule data will be described later 
with reference to FIGS. 3 and 4. After that, the 
V-click engine 36 acquires present time data from the 
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handler 34, reads, in reference with the present time 
data according to the schedule data, the object data of 
all necessary objects described in the read schedule 
data, and makes the objects in a ready state. Since 
5 the V-click engine 36 thus reads schedule data from 

ENAV contents 101, it can detect all necessary objects 
and the object data of each object. 

Reverse link operation can be executed without 
schedule data. In this case, the reproduction filename 

10 of DVD contents 102 is input, and all object data 

needed for the input reproduction file is made in a 
ready state. Thus, schedule data is not always needed 
for reverse link operation. 

Further, the V-click engine 36 turns on, for 

15 example, mask display. In this case, the engine 36 

clears all currently registered non-mask objects, and 
registers, as a non-mask object, the label of the 
object received from the parser/interpreter 35. The 
V-click engine 36 generates a non-mask area for the 

20 non-mask object, and outputs the non-mask area to the 

V-click decoder 321. If balloon display is in the ON 
state, the V-click engine 36 obtains the position of 
the object to be displayed as a balloon at the present 
time, computes the display position of the balloon, and 

25 outputs the balloon contents and the balloon display 

position of this object to the V-click decoder 321. 
Upon receiving a click event signal, the V-click engine 



20 - 



36 determines whether or not the currently valid 
objects include an object that contains the space- time 
point determined from the click position and the 
present time* 

5 There can be DVD contents 102 provided with ENAV 

contents 101, and those without ENAV contents 101. 
The V-click engine 36 reads schedule data and object 
data etc. described later, from ENAV contents 101. 
ENAV contents 101 contain the label of an object to be 
10 registered as a balloon display target. On the basis 

of this label, the V-click engine 36 registers the 
object as a balloon display target. 

For example, when a script of ENAV contents 101 is 
executed, the object in a moving picture is displayed, 
15 highlighted. 

The embodiment describes that the video 
reproduction apparatus is provided with a structure for 
reproducing audio data. However, this apparatus can 
be constructed so that no audio data is reproduceable . 
20 . Further, although no description is given of the 

. reproduction of audio data in DVD contents, assume that 
audio data is reproduced in synchronism with video data 
in the DVD contents. 

In the embodiment, each frame of video data in DVD 
25 contents 102 is specified by a timestamp attached to 

each frame. It is a matter of course that information 
other than the timestamp may be used to specify each 
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frame of video data. 

FIG. 2 shows a system configuration example of the 
video reproduction apparatus 1 of FIG. 1. 

The video reproduction apparatus 1 is connected to 
5 the monitor apparatus 8 such as a TV set. As seen from 

FIG. 2, the video reproduction apparatus 1 provides 
a multi-window environment. In the example of FIG. 2, 
a video display window 81 is provided for displaying 
contents of a DVD 100, such as a movie. An information 

10 display window 82 is provided for displaying an 

information screen such as a browser screen. A DVD 
operation window 83 is provided for operating the DVD 
100, utilizing soft keys, such as reproducing, stopping 
or fast-forwarding the DVD. It is preferable that a 

15 user can set the size, position, display/non-display, 

etc., of the video display window 81, the information 
display window 82 and the DVD operation window 83. 

The video reproduction apparatus 1 of the 
embodiment is operable using a remote controller 4. 

20 Further, in the example of FIG. 2, a speaker 84 

outputs, for example, the sound attached to video 
data displayed on the video display window 81. 

In the embodiment, the remote controller 4 and the 
video reproduction apparatus 1 enable a user to operate 

25 a cursor 812 displayed on the screen of the monitor 

apparatus 8, thereby realizing a graphical user 
interface. In the example of FIG. 2, an object 811, 
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link 821 and buttons 831 - 836 operate when a user 
clicks the cursor 812. The cursor 812 can be moved 
by a movement button on the remote controller 4. For 
example, when the object 811 is clicked, a predeter- 
5 mined event occurs. When the link 821 is clicked, 

information connected to the link is displayed. 
Further, when the button 835 is clicked, reproduction 
of DVD contents 102 is started. 

In the embodiment, click of an object is 

10 called, in particular, a "V-click (video click) " to 

discriminate this click from a click of an icon, such 
as a reproduction button, or a click of a link on 
a browser screen. In FIG. 2, a V-click means a click 
of the object 811. An object is a two-dimensional area 

15 defined by the frames ranging from a certain start 

frame to a certain end frame (a moving picture 
contained in DVD contents 102 comprises frames) . The 
position or the attribute (such as the configuration) 
of this area can vary in units of frames. The object 

20 may be, for example, a particular person, vehicle, 

building, etc., or part of them, for example, the 
head of the person, the bonnet of the vehicle or the 
entrance of the building, in a moving picture. 

For example, the object 811 indicates, in the form 

25 of an approximate oval, the area of an object or a 

personal appearing in a moving picture. Such an object 
or a person in a moving picture will be referred to as 
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a "visual object". In FIG. 2, the image corresponding 
to a visual object is not shown. Further, although the 
object 811 is shown in FIG. 2, it may not always be 
displayed. Alternatively, a user may determine whether 
5 or not the object 811 should be displayed, or the 

display of the object 811 may be automatically 
controlled on the basis of certain information. 

A set of two-dimensional object areas from the 
start frame to the end frame is three-dimensional 
10 since the dimension of time is included in it. When a 

V-click is performed at a point included in the three- 
dimensional area, the object is hit by the V-click. 
This three-dimensional area can be described in various 
ways . 

15 The above-described system configuration can be. 

modified in various ways. For example, another window 
may be added to the video display window 81, the 
information display window 82 and the DVD operation 
window 83. 

2 0 The video reproduction apparatus 1 and remote 

controller 4 can be connected by radio using an 
infrared ray, Bluetooth (registered trademark) , or any 
other means. Further, the remote controller 4 may be 
dedicated to the video reproduction apparatus 1 or may 

25 be a versatile one. For example, the remote controller 

4 may be formed of a mobile telephone terminal 
utilizing Bluetooth. 
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Furthermore, the video reproduction apparatus 1 
may have a function or functions employed in general 
DVD players. Of course, the video reproduction 
apparatus 1 may be formed as one body with the monitor 
5 apparatus 8 such as a TV set (for example, a TV set 

with a video reproduction function can be used) . 

In addition, the remote controller 4 may have a 
conventional input unit such as a reproduction button 
835, as well as an input unit for operating the cursor 

10 812. Further, the video reproduction apparatus 1 

itself may have an input unit such as a reproduction 
button. A mouse or a keyboard may be connectable to 
the video reproduction apparatus 1 itself. The speaker 
84 may be built in the monitor apparatus 8. 

15 Now three types of V-click data items are 

described. 

FIG. 3 shows the structures of schedule data items 
SD1, SD2, 

The schedule data items SD1 and SD2 are used to 
20 preset the start position and end position of each of 

activated states of objects corresponding to certain 
moving picture contents. Here the activated state of 
the object indicates that the object is positioned in 
a period in which the V-click can be received. 
25 The schedule data items SD1, SD2, are shown as 

"Scheduledata" in FIG. 3. In general, a plurality of 
schedule data items SD1, SD2, are provided as shown 
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in the top portion of FIG. 3. These schedule data 
items are called, as a whole, a "schedule file". 
The schedule data items are of different versions. 
In accordance with the reproduction capability of 
5 the video reproduction apparatus 1, schedule data of 

a certain version is read. In other words, different 
schedule data items are read if video reproduction 
apparatuses 1 of different reproduction capabilities 
are used. Further, the video data reproduction 
10 apparatus 1 may be constructed so that it is provided 

with a version corresponding to its reproduction 
capability, and this version is automatically referred 
to, thereby reading schedule data corresponding 
thereto . 

15 As seen from the middle portion of FIG. 3, the. 

schedule data SD2 comprises three types of field - 
fields for the above-mentioned "version" (Ver) SD21, 
"size" (Size) SD22 and "schedule" data items (Schedule) 
SD23, — , SD24. 

20 The "size" SD22 contains a numerical value 

indicative of the data capacity of the schedule data 
SD2 . The "size" SD22 also contains a numerical value 
used as an offset value. This value is used for 
skipping to next schedule data having a version 

25 different from selected "version" SD21, if the selected 

version data is not suitable for the video reproduction 
apparatus 1. If a break point can be set between 
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subsequent "version" data items, the "size" can be 

dispensed with. 

The "schedule" data items SD23, SD24 are each 

used to preset the start position and end position of 
5 the activated state of a certain object. If schedule 

data manages a plurality of objects, it has a plurality 

of "schedule" data items SD23, — , SD24. 

As shown in the bottom portion of FIG. 3, the 

"schedule" SD23 has five types of field. Specifically, 
10 the "schedule" SD23 comprises "label" (Label) SD231, 

"size" (Size) SD232, "start timestamp" (Start) SD233, 

"end timestamp" (End) SD234 and "object filename" 

(Filename) SD235. 

The "label" SD231 contains a name corresponding to 
15 each object managed by the "schedule" SD23. This name 

is an object name or an object identifier. 

The "size" SD232 in the "schedule" SD23 contains 

a numerical value indicative of the data capacity of 

each object managed by the "schedule" SD23. The "size" 
20 SD232 also contains a numerical value used as an offset 

value. This value is used for skipping to next 

schedule data having a label different from the "label" 

SD231, if the selected "label" SD231 does not 

correspond to the object to be currently reproduced by 
25 the video reproduction apparatus 1. If a break point 

can be set between subsequent "label" data items, the 

"size" SD232 can be dispensed with. 



The "start timestamp" SD233 indicates the start 
time of the activated state of an object corresponding 
to the "label" SD231 contained in the "schedule" SD23 
related to the "start timestamp", or indicates the 
start position of the activated state of the object 
in moving picture contents to be reproduced. If the 
time interval of activated states between the portions 
of the moving picture contents marked by objects, 
chapters, etc. is known beforehand, the start positions 
of the activated state of the objects can be computed 
from the start times of the activated state of the 
obj ects . 

The "end timestamp" SD234 indicates the end time 
of the activated state of an object corresponding to 
the "label" SD231 contained in the "schedule" SD23 
related to the end timestamp data, or indicates the end 
position of the activated state of the object in moving 
picture contents to be reproduced. 

The "start timestamp" SD233 and "end timestamp" 
SD234 are predetermined in units of "label" data items 
SD231, i.e., objects, thereby determining periods of 
the activated states of objects in the moving picture 
contents . 

The "object filename" SD235 contains a filename 
with which the corresponding object is defined, i.e., 
the name of an object file containing object data. 
As described in the second embodiment, if object data 
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exists on the Internet, a URL may be used as its object 
filename . 

In the video reproduction apparatus 1, when the 
video data of to-be-reproduced DVD contents 102 is 
5 designated using, for example, its reproduction 

filename, schedule data corresponding to the video data 
is read. The schedule data defines the object data 
that should be read and activated, the reproduction 
scene of the video data in which the object data should 
10 be read and activated, and the reproduction scene in 

which the. activated state of the object data should be 
finished. 

Further, in the reverse link operation described 
later, there is also a case where the schedule data is 
15 read to refer to the definition of an object, the start 

time of the activated state of the object, etc., using 
an object name stored in the "label" SD231 of the 
object data. 

In the example of FIG. 3, the schedule file 
20 contains a plurality of schedule data items SD1, SD2, 

However, the schedule file may have only one 
schedule data item. Further, if only one version is 
given to schedule data, the "version" SD21 field is not 
needed. 

25 FIG. 4 is a view illustrating the schedule of the 

period of the activated state of each object in certain 
schedule data. 



- 29 - 



As described above with reference to FIG . 3, the 
schedule data defines the object data that should be 
read and activated, the reproduction scene of the video 
data in which the object data should be read and 
5 activated, and the reproduction scene in which the 

activated state of the object data should be finished. 
If each scene of reproduced video data is made to 
correspond to a time point, each object is also 
designated by the start time and end time assigned to 

10 each object. In DVD contents 102, since each scene of 

reproduced video data is made to correspond to a time 
point, each object appearing during the reproduction of 
moving picture contents can be designated by the start 
time and end time assigned to each object. In the case 

15 of FIG. 4, the period of the activated state of an 

object are designated by the start time (Start) and end 
time (End) . 

In FIG. 4, activated state of an object is 
started at a time point S]_ and ended at a time point 

20 E]_. Thus, the activated state of an object 0^ (k = 1, 

2, 3, 4, 5 or 6) is started at a time point and 
ended at a time point E^. The start and end of the 
activated state of an object can be performed 
independently of any other object. For example, 

25 the activated state of an object O3 can be started 

and ended during the reproduction of an object O2 . 
Further, objects, for example, O4, O5 and O5, can 
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be activated, with their activation time periods 
overlapping each other. 

Without using schedule data, object data can be 
activated at a desired time. In this case, the 
5 reproduction filename of DVD contents 102 is input, and 

all object data necessary for a to-be-reproduced file 
designated by the reproduction filename is set in 
a ready state. 

However, a lot of time is required to make, in 

10 a ready state, all object data necessary for the to-be- 

reproduced file, since the object data needs to be read 
and stored into the internal memory of the V-click 
engine 36. Further, if a great amount of object data 
needs to be made in a ready state, it is very possible 

15 that the capacity of the internal memory of the V-click 

engine 36 may become short, and at worst, the object 
data cannot be activated. 

As shown in FIG. 4, in schedule data, object data 
for activating a predetermined object is read in 

20 association with the start time of the activated state 

of the object data, which requires a low memory 
consumption and hence reduces the possibility of the 
internal memory capacity of the V-click engine 36 
becoming short. Moreover, schedule data can minimize 

25 the amount of object data unnecessary at a certain 

time, thereby enabling more effective use of the 
internal memory of the V-click engine 36. As a result, 



the time required for displaying an object can be 
minimized. 

The structure of mask data is described with 
reference to FIG. 5, on the basis of which a moving 
picture is subjected to mask processing. 

In FIG. 5, mask data items MD1, MD2, are shown 
as "Maskdata". In general, a plurality of mask data 
items MD1, MD2, are provided as shown in the top 
portion of FIG. 5. These mask data items are of 
different versions (Ver) . In accordance with the 
reproduction capability of the video reproduction 
apparatus 1, mask data of a certain version is read. 
In other words, different mask data items are read 
if video reproduction apparatuses 1 of different 
reproduction capabilities are used. Further, the video 
data reproduction apparatus 1 may be constructed so 
that it is provided with a version corresponding, to 
its reproduction capability, and this version is 
automatically referred to, thereby reading mask data 
corresponding thereto. 

As seen from the middle portion of FIG. 5, the 
mask data MD2 comprises three types of field - fields 
for the above-mentioned "version" (Ver) MD21, 
"size" (Size) MD22 and "mask" data items (Mask) MD23, 
MD24. 

The "size" MD22 contains a numerical value 
indicative of the data capacity of the "mask" MD23. 
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The "size" MD22 also contains a numerical value used as 
an offset value. This value is used for skipping to 
next mask data having a version different from selected 
"version" MD21, if the selected version data is not 
suitable for the video reproduction apparatus 1. If a 
break point can be set between subsequent "version" 
data items, the "size" can be dispensed with. 

The "mask" data items MD23, MD24 are each used 

to preset the color of a mask and the reproduction 
start position and end position of a mask pattern. 
If mask data manages a plurality of masks, it has 
a plurality of "mask" data items MD23, MD24. 

As shown in the bottom portion of FIG. 5, the 
"mask" MD23 has four types of field. Specifically, the 
"mask" MD23 comprises "start timestamp" MD231 (Start), 
"end timestamp" MD232 (End), "color" MD233 (Color) and 
"pattern" MD234 (Pattern) . 

The "start timestamp" MD231 indicates the 
reproduction start time of a "mask" MD23 designated by 
the "color" MD233 and the "pattern" MD234 contained in 
the "mask" MD23 related to the "start timestamp" MD231, 
or indicates the reproduction start position of the 
"mask" MD231 in the moving picture contents. 

The "end timestamp" MD232 indicates the 
reproduction end time of the "mask" MD23 designated by 
the "color" MD233 and the "pattern " MD234 contained in 
the "mask" MD23 related to the "end timestamp" MD232, 
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or indicates the reproduction start position of the 
"mask" MD23 in the moving picture contents. 

As described above with reference to FIG. 4, in 
the schedule data, there is a case where before the 
5 activated state of the object that started at the start 

time SD233 is finished, further the activated state of 
another object is started, as in the case of the 
objects O2 and O3 or the objects O4, O5 and O5. On the 
other hand, in the case of the "mask" MD23, no further 
10 reproduction is started before reproduction that 

started at the time indicated by the "start timestamp" 
MD231 is finished. In other words, during reproduction 
of a certain mask, no other mask is reproduced. 

The "color" MD233 designates the color of a mask. 
15 The "pattern" MD234 designates a mask pattern. Mask 

patterns are bit maps of, for example, 32 X 32. 

In the example of FIG. 5, a plurality of mask data 
items MD1, MD2 are shown. However, there is a case 

where only one mask data item is used. Further, where 
20 only one version is imparted to mask data, the 

"version" MD21 field is not needed. 

The structure of object data is described with 
reference to FIG. 6. 

In FIG. 6, object data items OD1, • *, OD2 are 
25 shown as "Objdata". In general, a plurality of object 

data items OD1, OD2 are provided as shown in the 

top portion of FIG. 6. These object data items are 
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called, as a whole, an "object file". The object data 
items are of different versions (Ver) . In accordance 
with the reproduction capability of the video 
reproduction apparatus 1, object data of a certain 
version is read. In other words, different object 
data items are read if video reproduction apparatuses 1 
of different reproduction capabilities are used. 
Further, the video data reproduction apparatus 1 may be 
constructed so that it is provided with a version 
corresponding to its reproduction capability, and this 
version is automatically referred to, thereby reading 
object data corresponding thereto. 

As seen from the middle portion of FIG. 6, the 
object data 0D2 comprises three types of field - fields 
for the above-mentioned "version" (Ver) 0D21, "size" 
(Size) OD22 and "object" data items (Object) OD23, 
OD24. The "size" OD22 contains a numerical value 
indicative of the data capacity of the object data 0D2 . 
The "size" OD22 also contains a numerical value used as 
an offset value. This value is used for skipping to 
next object data having a version different from 
selected "version" 0D21, if the selected version data 
is not suitable for the video reproduction apparatus 1. 
If a break point can be set between subsequent 
"version" data items, the "size" OD22 can be dispensed 
with. 

The "object" data items OD23, ■**, OD24 are each 
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used to determine the operation of an object when the 
object is activated. If object data 0D2 manages a 
plurality of objects, it has a plurality of "object" 
data items OD23, OD24 . 

As shown in the bottom portion of FIG. 6, the 
"object" OD23 has six types of field. Specifically, 
the "object" OD23 has fields for "label" (Label) OD231, 
"size" (Size) OD232, "script" (Script) OD233, "MPEG7 
field" (MP7) OD 234, "order" (Order) OD235 and 
"balloon" (Balloon) OD236. The "label" OD231 contains 
a name corresponding to an object managed by the 
"object" OD23. This name is, for example, an object 
name or object identifier. Referring to the "label" 
OD231 and the "object filename" SD235 of schedule data, 
the activated state of an object based on the schedule 
data is performed. 

The "size" OD232 contained in the "object" OD23 
contains a numerical value indicative of the data 
capability of the "object" OD23. The "size" OD232 also 
contains a numerical value used as an offset value. 
This value is used for skipping the "object" OD23 and 
seeking the next "object" or the "object" OD24 that is 
the last "object" of the object data, if the selected 
"label" OD231 does not the object corresponding to the 
video data contents to be reproduced by the video 
reproduction apparatus 1. If a break point can be set 
between subsequent object data items, the "size" OD232 
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can be dispensed with. 

The "script" OD233 contains a script to be 
executed by the video reproduction apparatus 1 if 
a user click event (V-click) hits an object. The 
5 "script" OD233 may contain a URL, to which jumping is 

performed when a user click event (V-click) hits the 
object. Further, the "script" OD233 may contain no 
information . 

The "MPEG7 field" OD234 contains data defining the 

10 area, configuration or a position of an object that 

moves in a moving picture, or data defining the time 
related to the movement of the object. The data 
contained in the "MPEG7 field" OD234 relates to 
a moving picture in a three-dimensional area as a 

15 space-time (two-dimensional plane + time) . Further, 

the "MPEG7 field" OD234 contains one or more reference 
time points, and an interpolation logic for performing 
interpolation on the configuration of the object 
between the reference time points on the basis of 

20 the reference time points and an object configuration. 

The "MPEG7 field" OD234 also contains a determination 
logic for determining whether or not the object has 
been clicked. 

There are many data formats for the "MPEG7 field" 

25 OD234. The "MPEG7 field" OD234 can use various methods 

to determine whether or not the object is hit by a 
V-click corresponding to each data format. For 
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example, the object-area-information describing or 
determining methods disclosed in Jpn. Pat. Appln. 
KOKAI Publications Nos. 2000-285253, 2001-118075 and 
2001-111996, etc. may be used. 

The "order" OD235 contains data used to determine, 
if the object data OD2 manages a plurality of objects, 
the order of overlapping of the objects when moving 
picture contents is reproduced. In the example of 
FIG. 6, the "order" OD235 contained in the "object" 
OD23 or OD24 contains data used to determine the order 
of overlapping of the objects described in the "object" 
OD2 3 or OD2 4. 

For example, the "order" OD235 contains a 
nonnegative integer. If the "order" OD235 correspond- 
ing to a certain object contains a higher value than 
that corresponding to another object, the former object 
is superposed upon the latter object. The numerical 
value contained in the "order" OD235 is determined when 
object data is created. Further, if a certain object 
is never be superposed on any other object, the "order" 
OD235 field itself may be omitted, assuming that 
the "order" OD235 corresponding to this certain object 
may be made to contain a default value of, for example, 
0, and the "order" OD235 can be dispensed with. 

The "order" OD2 3 5 may be needed when two or 
more objects are simultaneously hit by a click event 
(V-click) . More specifically, the "order" OD235 may be 
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needed when a plurality of objects exist at a single 
time point in a moving picture, and should be simulta- 
neously hit by one V-click. In this case, the 
following three operations, for example, can be 
performed: 

(1) The scripts corresponding to all the objects 
to be hit are executed; 

(2) Only the script corresponding to the object 
that is defined to be displayed on the uppermost layer 
is executed; and 

(3) Only the scripts are executed, which 
correspond to the objects that range from the object to 
be displayed on the uppermost layer to an object on 

a ' predetermined lower layer. 

In the operations (2) and (3), it is necessary to 
determine the order of some objects, therefore the 
scripts corresponding to these objects must be executed 
with reference to the above-mentioned "order" OD235. 
On the other hand, in the operation (1), it is not 
necessary to arrange the object in order, therefore the 
"order" OD235 is not needed. 

The "balloon" OD236 contains data concerning 
a "balloon" displayed near an object in reproduced 
video data contents. Specifically, the "balloon" OD236 
contains, for example, a balloon text, font name, font 
size, font color, the background color of characters in 
a balloon, and boundary color. The balloon function 
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is not indispensable. If this function is not needed, 
it is sufficient if no data is contained in the field 
of the "balloon" OD236. Of course, if the balloon 
function is not needed, the "balloon" OD236 field 
5 itself may be omitted. 

In the above, schedule data and object data are 
provided separately, and the object filename or URL of 
the object data is input in the object filename SD235 
field (Filename) of the schedule data. Instead of 
10 providing the "object filename" SD235, the schedule 

data and the object data may be formed integral as one 
body. 

FIG. 7A shows the movement with time of an object 
in frames. FIG. 7B shows the frames appearing in 

15 FIG. 7A. FIG. 7C shows the movement of the object 

appearing in FIG. 7A. 

In FIG. 7A, reference numeral 120 denotes a start 
frame that contains an object, reference numeral 122 
an end frame that contains the object, reference 

20 numeral 121 a mid frame that contains the object, . 

reference numeral 140 the area of the object in the 
start frame 120, reference numeral 142 the area of the 
object in the end frame 122, and reference numeral 141 
the area of the object in the mid frame 121. FIG. 7B 

25 shows the frames 120, 121 and 122 extracted from 

FIG. 7A. Further, FIG. 7C shows the objects 140, 141 
and 142 extracted from FIG. 7A. 



In the example of FIG. IK, the object moves from 
the start frame 120 to the end frame 122, i.e., from 
the left side of the video display window 81 to the 
right side. The object areas 140, 141 and 142 indicate 
the positions of the object assumed when the object is 
moving from the left side to the right side. 

When, for example, a user clicks (V-click) a point 
inside an object area included in one of the frames 
ranging from the start frame 120 to the end frame 122, 
the determination algorithm employed in the apparatus 
determines that the object is hit. In another case, 
when a user clicks (V-click) a point inside or near 
an object area, the determination algorithm determines 
that the object is hit. 

Here a reverse link operation is described. 
FIG. 8 is a view useful in explaining the reverse link 
operation. 

An object jump button 813 that can be clicked is 
displayed on the video display window 81. An instruc- 
tion for, for example, reproducing an object related to 
this button is written on the object jump button 813. 
Upon clicking the object jump button 813, for example, 
reproduction is designated on the basis of the filename 
of a to-be-reproduced file in DVD contents 102, the 
label of an object, and the reproduction start time 
and end time of the object. In the example of FIG. 8, 
a message "Reproduce Visual Object A" is placed on the 



41 



object jump button 813. Here the Visual Object is the 
object that is overlaid on the video data contents and 
is visualized in reproduction. There may be another 
example, in which the HTML text displayed on the 
5 information display window 82 of FIG. 2 contains, 

as tag data, the filename of a to-be-reproduced file 
in DVD contents 102, the label of an object, and the 
start and end times of the object, and the video data 
contents corresponding to the object is reproduced on 

10 the video display window 81 when a user clicks a tag 

corresponding, to the tag data. 

When a user moves the cursor 812 to the object 
jump button 813 and clicks the button, a moving picture 
in the video display window 81 is switched to a moving 

15 visual object A 143. The moving visual object A 143 is 

reproduced during the time ranging from the start frame 
120 to the end frame 122. 

The object jump button 813 is associated with 
object data corresponding to the visual object A 143. 

20 When the object jump button 813 is clicked by the 

cursor 812, the object data corresponding to the visual 
object A 143 is read to thereby reproduce the object. 
Specifically, as described above, object data that 
contains a to-be-reproduced object is retrieved from 

25 the object file on the basis of the label of the 

object. Further, object data may be retrieved using 
schedule data. 
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When, for example, a script of ENAV contents 101 
is executed, an object in a moving picture can be 
displayed, highlighted, in association with the reverse 
link operation, 

FIG. 9 is a flowchart useful in explaining the 
reverse link operation of the video reproduction 
apparatus 1 . 

When a user clicks the object jump button 813 
in the video display window 81 or the link 821 in 
the information display window 82, the user event 
controller 33 receives this click as a user event 
signal (step SI) . Since this user event, signal 
corresponds to the reverse link operation, the user 
event controller 33 outputs the user event signal to 
the handler 34 (step S2) . 

The handler 34, in turn, outputs the user event 
to the parser/interpreter 35 (step S3) . The parser/ 
interpreter 35 reads, from tag data contained in the 
user event signal, the label and start and end times 
of an object to be activated by the reverse link 
operation, and also the filename of a to-be-reproduced 
file in DVD contents 102 corresponding to the object 
(step S4) . 

The parser/interpreter 35 generates a reproduction 
instruction signal on the basis of the start and end 
times and filename of the to-be-reproduced file in the 
DVD contents 102 read at the step S4, and outputs the 
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signal to the handler 34 (step S5) . The reproduction 
instruction signal has information concerning the 
reproduction filename, and the start and end times of 
the to-be-reproduced DVD contents 102. An object 
5 generated at a later step by the AV renderer 31 is 

overlaid on the moving picture corresponding to the to- 
be-reproduced file in the DVD contents 102. Thus, the 
moving picture of the DVD contents 102 is synthesized 
with a moving picture as the object. 

10 The handler 34 supplies the playback engine 2 with 

the instruction signal for reproducing the DVD contents 
102 (step S6) . Upon receiving the reproduction 
instruction signal, the playback engine 2 causes the 
reproduction of the to-be-reproduced file in the DVD 

15 contents 102 to start at the designated start time 

(step S7) . 

The parser/interpreter 35 supplies the V-click 
engine 36 with the start time and the end time, 
the label of the to-be-reproduced object and the 

20 reproduction filename (filename of the to-be-reproduced 

file in the DVD contents 102) read at the step S4 
(step S8) . The V-click engine 36 reads schedule data 
corresponding to the reproduction filename of the DVD 
contents 102 from ENAV contents 101. Referring to the 

25 label and the schedule data of the object, the V-click 

engine 36 acquires object data corresponding to the 
object, thereby preparing execution of the object. 



The parser/interpreter 35 outputs the reproduction 
filename of the DVD contents 102 to the V-click decoder 
321 (step S9) . On the basis of the reproduction 
filename of the DVD contents 102, the V-click decoder 
321 executes, for example, mask processing and balloon 
processing. Mask processing is executed using mask 
data read from the ENAV contents 101, while balloon 
processing is executed using the object data. 

At the start time, the moving picture of the DVD 
contents 102 and the object are reproduced. At the end 
time, the execution of the object is stopped. 

FIG.. 10 is a flowchart useful in explaining the 
operation of the V-click engine 36 during the reverse 
link operation of the video reproduction apparatus 1. 

Firstly, all objects currently kept ready in the 
internal memory of the V-click engine 36 are cleared 
therefrom (step Sll) . 

A schedule file corresponding to the filename of 
to-be-reproduced DVD contents 102 supplied from the 
parser/interpreter 35 is read from ENAV contents 101 
(step S12) . 

Subsequently, present time data is acquired from 
the handler 34 (step S13) . In this embodiment, the 
playback engine 2 supplies a timestamp to the handler 
34 periodically or as the occasion demands. On the 
schedule file acquired at the step S12, it is 
determined whether or not there is object data to be 



45 



reproduced at the present time. The object file 
contained in all to-be-reproduced objects is read (step 
S14) . For example, if the schedule file specifies that 
the reproduction start and end times of the object with 
5 a label of "00005" are "001333" and "002000", an object 

file that contains the definition of the object is read 
from the ENAV contents 101 by the time of "001333". 

On the basis of the read object file, the objects 
are made in a ready state. That is, these objects are 

10 internally registered as objects to be subjected to 

a determination as to whether or not a click time and. 
the position corresponding to a click event (V-click) 
signal hits the objects. Since the V-click engine 36 
has already read the schedule data, it can immediately 

15 detect object data corresponding to each object that 

satisfies the above condition. 

Mask display is turned on (step S15) . If there is 
any non-mask object currently internally registered, 
the V-click engine 36 clears it (step S16) , and 

20 registers, as the label of a non-mask object, the label 

of each object supplied from the parser/interpreter 35 
(step S17) . For example, ENAV contents 101 contain the 
label of each non-mask object to be registered, and 
registration is performed on the basis of this label. 

25 The configuration of a non-mask object at the 

present time is computed, and the area of the non-mask 
object is generated on the basis of the computation 
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result (step S18) . The resultant non-mask area is 
output to the V-click decoder 321 (step S19) . If a 
plurality of non-mask areas exist, the logical sum of 
the non-mask areas is supplied as the non-mask area to 
5 the V-click decoder 321. 

It is determined whether or not balloon display is 
ON, and if balloon display is ON, the program proceeds 
to a step S21. If, on the other hand, balloon display 
is OFF, the program proceeds to a step S22 . At the 

10 step S21, the position of an object corresponding to a 

balloon to be displayed is detected, thereby computing . 
the position of the balloon, and supplying the V-click 
decoder 321 with the contents and the position of the 
balloon. After that, the program proceeds to the step 

15 S22. The balloon display position is predetermined. 

It is, for example, above and to the right of an 
object. The balloon display position may be made 
changeable, such as to be left or right of an object, 
or above left or right of the object, by appropriately 

20 changing the variable of the V-click engine 36. 

Further, there may be a case where the labels of a 
number of objects are pre-registered as balloon display 
targets in the V-click engine 36. Such registration is 
made referring to, for example, ENAV contents 101 that 

25 stores the labels of objects as to-be-registered 

balloon display targets. 

When a user clicks the screen of the video 
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reproduction apparatus 1, using a remote controller or 
mouse, a click event signal is generated. This click 
event signal is output to the V-click engine 36 
together with click position information. At the step 
5 S22, it is determined whether or not a click event 

signal is input. If no click event signal is input, 
the program returns to the step S13, whereas if a click 
event signal is input, the program proceeds to a step 
S23. At the step S23, a currently valid object is 

10 detected which contains the space-time point designated 

by the click position and the present time, i.e., which 
is hit by the above click. 

At the next step S24, it is determined whether or 
not there is an object hit by the click. If no such 

15 object exists, the program returns to the step S13, 

whereas if such an object exits, the program proceeds 
to a step S25. At the step S25, any object whose 
reproduction end time has been exceeded is cleared. 
The V-click engine 36 contains objects that are ready 

20 for a click, mask display and balloon display. The 

V-click engine 36 compares the reproduction end time of 
each object with the present time, thereby clearing, 
from the internal memory, any object whose reproduction 
end time has been exceeded. A script for any object 

25 contained in the non-cleared objects and hit by the 

click is sent to the parser/interpreter 35 (step S26) , 
followed by the program returning to the step S13. 
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As described above, an object in a moving picture 
is- called for by a user click on a browser screen, 
thereby realizing a reverse link operation. Further, 
this object is highlighted or displayed in a balloon as 
5 a result of mask exclusion. 

In the example of FIG. 10, only one object is hit 
by a click, therefore no determination concerning 
"priority" between a plurality of objects is performed 
using numerical values related to "order". If order 
10 data is used as previously mentioned, even a plurality 

of objects that are hit by one click can be processed. 

Referring to FIGS. 11 and 12, the relationship 
between a visual object 810 and object 811 in a moving 
picture contained in DVD convents 102 will be 
15 described. 

FIG. 11 is a view showing a video display window 
in which mask processing is executed. 

Assuming that the interior of the object 811 is 
a non-mask area 815, and the area in the video display 
20 window 81 other than the non-mask area 815 is a mask 

area 814, the mask area 814 is subjected to mask 
processing. As a result, the object 811 or visual 
object 810 can be highlighted. 

There are various types of mask processing. 
25 For example, a certain mask image is displayed instead 

of a contents image in a mask area, a mask image is 
superposed upon a contents image in a mask area, or the 



brightness or color saturation of a mask area is 
reduced. In the embodiment, a mask image is superposed 
upon a contents image in a mask area. 

FIG. 12 is a view illustrating a state in which 
the visual object 810 and the cursor 812 overlap. 

FIG. 12 shows an example of a relationship between 
the visual object 810 and object 811 in one of the 
frames ranging from the start frame 120 to the end 
frame 122 shown in FIG. 7A. In the example of FIG. 12, 
a rectangular visual object 810 exists in a screen 
image, and an elliptic object 811 slightly larger than 
the visual object 810 is defined so that a user can 
easily click the visual object. 

As described above, the object 811 may be 
displayed or may not be displayed. When the object 811 
is not displayed, the user can see only the visual 
object 810. In this case, when the user clicks 
the visual object 810, the object 811 is actually 
V-clicked. 

Of course, the shape of the objects is not limited 
to a rectangle or ellipsoid. 

FIG. 13 illustrates a state in which a balloon is 
protruded from the visual object 810. 

The balloon function is a variation of the 
function of displaying information related to the 
visual object 810. A balloon in which, for example, 
an explanation sentence related to the visual object 
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810 is written is displayed near the visual object 811 
on the video display window 81, as shown in FIG. 13. 

In the descriptions so far, in the reverse link 
operation, when a user clicks, for example, a browser 
5 screen button or link (when the user clicks, for 

example, the link 821 on the window 82 of FIG. 2), if 
this button or link is associated with an object in 
a moving picture, this object is referred to. 
Further, the reverse link operation enables an object 

10 in a moving picture to be displayed highlighted when, 

for example, a script in ENAV contents 101 is executed. 
In reverse link operation above described in detail, 
the object is treated the object displayed in the 
display. However the object may be treated the object 

15 non-displayed in the display. In case that non- 

displayed object is treated, the reproduction of the 
video data contents can be started from the start time 
of the video data contents corresponding to the object. 
FIG. 14 is a flowchart useful in explaining the 

20 operation of the video reproduction apparatus 1 

executed upon receiving a user event signal during its 
reproduction operation . 

The instruction (s ) contained in a user event 
signal received by the user event controller 33 will be 

25 described, taking a mask-display ON/OFF instruction, 

balloon-display ON/OFF instruction and click 
instruction as examples. 
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Firstly, a user clicks to generate a reproduction 
instruction event signal, and the user event controller 

33 receives this event signal (step S101) . The user 
event controller 33, in turn, outputs the reproduction 

5 instruction event signal to the playback engine 2 (step 

SI 02) . The playback engine 2 outputs the name of a to- 
be-reproduced file in DVD contents 102 to the handler 

34 (step S103) . 

The handler 34 outputs the reproduction filename 

10 of the DVD contents 102 to the parser/interpreter 35 

(step S104) . The parser/interpreter 35 supplies the 
V-click engine 36 with the reproduction filename and 
an instruction to read the reproduction filename (step 
S105) . Similarly, the parser/interpreter 35 supplies 

15 the V-click decoder 321 with the reproduction filename 

and an instruction to read the reproduction filename 
(step S106) . The V-click engine 36 reads the to-be- 
reproduced file in the DVD contents 102 corresponding 
to the reproduction filename, and waits for a mask- 

20 display ON/ OFF event, balloon-display ON/OFF event or 

click event that may occur later. The V-click decoder 
321 prepares for creating a mask or balloon on the 
basis of the non-mask area, the balloon contents and 
the balloon display position generated by the V-click 

25 engine 36. 

The playback engine 2 starts to reproduce the 
to-be-reproduced file in the DVD contents 102 
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corresponding to the reproduction filename (step S107) . 
At the step S107, during the reproduction of the file, 
the user event controller 33 assumes a waiting state 
for waiting for a user event signal. In this state, 
5 the user event controller 33 determines whether or not 

a user event signal has been generated (step S108). 
At the step S108, if the user event controller 33 
determines that a user event signal has been generated, 
it outputs the user event signal to the parser/ 

10 interpreter 35 via the handler 34 (step S109) . 

The parser/interpreter 35 determines whether or 
not the instruction contained in the user event signal 
is an instruction as to mask display ON/OFF (step 
S110) . If the instruction is determined to be a mask 

15 instruction as to the mask display ON/OFF, the 

program proceeds to a step Sill, whereas if the 
instruction is determined not to be the mask 
instruction as to the mask display ON/OFF, the program 
proceeds to a step S112. 

20 At the step Sill, the parser/interpreter 35 

supplies the V-click engine 36 with an instruction to 
turn on or off mask display, followed by the program 
proceeding to the step S112. The V-click engine 36 
stores an internal flag that indicates that mask 

25 display is in the ON or OFF state. At the step S112, 

the parser/interpreter 35 determines whether or not 
the instruction contained in the user event signal is 
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a balloon instruction as to balloon display ON/OFF. 
If the instruction contained in the user event signal 
is determined to be the balloon instruction as to the 
balloon display ON/OFF, the program proceeds to a step 
5 S113, whereas if the instruction contained in the user 

event signal is determined not to be the instruction as 
to the balloon display ON/OFF, the program proceeds to 
a step S114. 

At the step S113, the parser/interpreter 35 

10 supplies the V-click engine 36 with an instruction to 

turn on or off balloon display, followed by the program 
proceeding to the step S114. The V-click engine 36 
stores an internal flag that indicates that balloon 
display is in the ON or OFF state. At the step S114, 

15 the parser/interpreter 35 determines whether or not 

the instruction contained in the user event signal is 
an instruction to click an object. If the instruction 
is determined to be the instruction to click an object, 
the program proceeds to a step S115, whereas if the 

20 instruction is determined not to be the instruction to 

click an object, the program returns to the step S108. 

At the step S115, the parser/interpreter 35 
outputs a click event signal to the V-click engine 36, 
and also outputs, to the engine 36, the time of the 

25 occurrence of a click corresponding to the click event 

signal and the position of the click on the video 
display window 81. 
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The parser/interpreter 35 receives a script 
corresponding to the instruction contained in the click 
event signal from the V-click engine 36, thereby 
executing the script (step S116) . After the step S116, 
5 the program returns to the step S108, where the user 

event controller 33 monitors occurrence of another user 
event . 

FIG. 15 is a flowchart useful in explaining an 
operation of the V-click engine 36 shown in FIG. 1. 

10 This operation corresponds to the operation of the 

V-click engine 36 illustrated in FIG. 14. Further, 
the same operations as illustrated in FIG. 10 are 
denoted by corresponding reference numerals and their 
explanations are omitted. 

15 As described above referring to FIG. 10, the 

V-click engine 36 makes an object in a waiting state on 
the basis of the read object data, and then determines 
whether or not mask display is ON. As described at the 
step Sill of FIG. 14, the ON/OFF Of mask display is 

20 stored in the form of an internal flag of the V-click 

engine 36. If mask display is ON, the program proceeds 
to a step S20. The processing later is the same as 
that described with reference to FIG. 10. 

FIG. 16 is a flowchart useful in explaining the 

25 operation of the V-click decoder 321 shown in FIG. 1. 

The V-click decoder 321 receives, from the V-click 
engine 36, V-click engine data, such as a non-mask 
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area, the contents and the position of a balloon, etc., 
when necessary, and appropriately processes it. 

Upon receiving, from the parser/interpreter 35, 
an instruction to read a reproduction filename and 
5 a corresponding to-be-reproduced file, the V-click 

decoder 321 reads mask data corresponding to the 
reproduction filename (step S301) . Further, the 
V-click decoder 321 acquires present time data from the 
handler 34 (step S302) . 

10 Upon receiving the balloon contents and position 

from the V-click engine 36, the V-click decoder 321 
stores them in its internal memory. If the internal 
memory already stores the balloon contents and the 
position, they are replaced with the input ones. 

15 Specifically, the internal memory is checked and it is 

determined whether or not the balloon contents and the 
position are stored. If they are stored, the program 
proceeds to a step S304, whereas if they are not 
stored, the program proceeds to a step S305. ■ At the 

20 step S304, a balloon is generated from the stored 

. balloon contents and the position, and writing the 
balloon to a video memory. 

Upon receiving a non-mask area from the V-click 
engine 36, the V-click decoder 321 stores it in the 

25 internal memory. If the internal memory already stores 

a non-mask area, this area is replaced with the input 
one. Specifically, the internal memory is checked and 
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it is determined whether or not a non-mask area is 
stored. If a non-mask area is stored, the program 
proceeds to a step S306, whereas if it is not stored, 
the program returns to the step S302. At the step 
5 S306, a mask, in which the non-mask area is excluded, 

is generated and recorded to the video memory. 
Further, the V-click decoder 321 generates a mask using 
a pattern and color designated by mask data obtained at 
the present time. According to the embodiment, the 

10 video data reproduction apparatus can have an effective 

navigation function for video data contents. For 
example, the video data reproduction apparatus can do 
the reverse link operation and effectively display 
an object. 

15 (Second Embodiment) 

FIG. 17 shows a state in which the video 
reproduction apparatus 1 is connected to a server 7 via 
the Internet 6. 

In the second embodiment, the video reproduction 

20 apparatus 1 can communicate with a desired server 7 

via a network (hereinafter referred to as the 
"Internet") 6. 

A description will now be given, in particular, 
of the case where V-click data is supplied to ENAV 

25 contents 101. 

ENAV contents 101 can be supplied to certain DVD 
contents 102 in various ways: (1) As in the first 
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embodiment, ENAV contents 101 for corresponding 
particular DVD contents 102 is recorded to a DVD 100 
that contains the particular DVD contents 102. 
(2) ENAV contents 103 for particular DVD contents 102 
5 is provided via a certain server 7 on the Internet 6. 

The same ENAV contents as above can be also provided 
using a combination of the ways (1) and (2). 
Alternatively, a part of ENAV contents may be provided 
using the way (1), and another part of the ENAV 

10 contents be provided using the other way (2). The way 

(1) is advantageous in that no communication is needed. 
On the other hand, the way (2) is advantageous in that 
ENAV contents 101 can be easily updated (by, for 
example, fetching a newest object file via the Internet 

15 6 to update the operation of an object hit by a click), 

or in that new ENAV contents can be easily provided 
even to a DVD 100 with no ENAV contents 101 shipped in 
the past. 

In particular, if a user would like to use only 
20 V-click data recorded on a DVD 100, they do not have to 

load any unnecessary object into a memory in a video 
reproduction apparatus. This realizes low consumption 
of resources. 

Further, if a user would like to use V-click data 
25 downloaded via the Internet 6, they can add the V-click 

data later to the contents of a DVD 100 that contains 
no V-click data. Thus, they can provide a new function 
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utilizing a V-click to the DVD contents 102 that they 
already own. Moreover, even DVD contents 102 that 
already contains V-click data can be updated in its 
operation by acquiring new V-click data via the 
5 Internet. 

In the above-described way (1), since ENAV 
contents 101 is recorded on a DVD 100, the playback 
engine 2 can read the ENAV contents 101 from the 
DVD 100. 

10 In the way (2), since ENAV contents 103 is on the 

Internet 6, it is downloaded to the video reproduction 
apparatus 1 from a server 7 connected thereto and the 
Internet 6. Alternatively, ENAV contents 103 may be 
downloaded from a server to an apparatus (such as a PC, 

15 monitor apparatus 8, etc.) connected thereto via the 

Internet 6, and be transferred from this apparatus to 
the video reproduction apparatus 1. In this case, a 
URL indicative of the position of desired ENAV contents 
103 on the Internet 6 (or the URL of an information 

20 site that can provide information used to acquire the 

first-mentioned URL) may be prestored in the video 
reproduction apparatus 1. If this URL is recorded on 
a DVD 100, the playback engine 2 reads it from the 
DVD 100. 

25 Further, the video reproduction apparatus 1 may be 

connected to a home network that is connected to the 
Internet 6. 
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FIG, 18 is a block diagram illustrating a video 
reproduction apparatus 11 according to the embodiment. 

The video reproduction apparatus 11 of this 
embodiment differs from the video reproduction 
5 apparatus 1 of FIG. 1 in that the former further 

comprises a V-click storage 37. The V-click storage 37 
stores V-click data contained in ENAV contents 103 and 
downloaded via the Internet 6. Further, the storage 37 
can also store V-click data contained in ENAV contents 
10 101 recorded on a DVD 100. In this case, the video 

reproduction apparatus 11 can acquire a part of the 
ENAV contents 101 from the DVD 100, and another part of 
the ENAV contents 103 from the Internet 6. 

Furthermore, the video reproduction apparatus 11 
15 can be modified so that the V-click data of necessary 

ENAV contents 103 will be downloaded and used at once 
when needed, instead of employing the V-click storage 
37 and storing V-click data in it. 

It is a matter of course that any other data of 
2 0 ENAV contents may be downloaded and stored in a 

predetermined storage, or may be downloaded and used at 
once when needed, without storing it. 

Otherwise, the video reproduction apparatus 11 of 
the embodiment is different to the video reproduction 
25 apparatus 1 of the first embodiment shown in FIG. 1. 

In the video reproduction apparatus 11, the parser/ 
interpreter 35 executes processing for identifying the 
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server. When the video reproduction apparatus 11 is 
connected to the server designated by a URL contained 
in the script of ENAV contents 101, the parser/ 
interpreter 35 executes the processing for identifying 
5 the server. If the identification has succeeded, 

the parser/interpreter 35 supplies the server with 
a request for downloading V-click data. 

The parser/interpreter 35 prestores a public key 
P k corresponding to the secret key S k of the server. 

10 Assume that the parser/interpreter 35 has received 

V-click data and data A from a server. At this time, 
the parser/interpreter 35 firstly computes a hash value 
H from the V-click data. The same function as that of 
the server is used to compute the hash value. 

15 Subsequently, the parser/interpreter 35 decodes 

the data A, using P k , thereby obtaining data D (P k ) 
[A]. The parser/interpreter 35 decodes the data D (P k ) 
[A] , using a contents ID (CID) , thereby obtaining D 
(CID) [D (P k ) [A]]. The parser/interpreter 35 

20 determines whether or not the computed H is identical 

to D (CID) [D (P k ) [A]]. If they are identical, the 
parser/interpreter 35 determines that the V-click data 
received from the server is certified. If, on the 
other hand, they are not identical, the parser/ 

25 interpreter 35 determines that the V-click data 

received from the server is not certified. The 
processing for identifying is described later in detail 
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with reference to FIG . 22. 

A description will be given of the case shown in 
FIG. 18 where the V-click storage 37 is provided, and 
V-click data is downloaded from a server 7 on the 
Internet 6 that provides ENAV contents 103, and is 
stored for use in the V-click storage 37. 

In this case, the video reproduction apparatus 11 
can download V-click data from a server 7 on the 
Internet 6 that provides ENAV contents 103, and can 
store it in the V-click storage 37. 

The download of V-click data is started by, for 
example, a user's request, or by executing a script 
in ENAV contents 101 recorded on a DVD 100 or in 
downloaded ENAV contents 103. The downloaded V-click 
data is processed in the same manner as V-click data 
contained in ENAV contents 101 recorded on a DVD 100. 
To-be-downloaded V-click data is at least one of 
schedule data, mask data and object data. For example, 
the schedule data of ENAV contents recorded on a DVD 
100 may be used, while object data and mask data may be 
replaced with those obtained from a server 7 on the 
Internet 6 that provides ENAV contents 103 . 

Downloaded V-click data is stored in association 
with DVD contents 102. Since DVD contents 102 has 
an ID dedicated thereto, downloaded V-click data can be 
stored in association with DVD contents 102. Further, 
even if contents ID is not explicitly defined, a hash 
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value created on the basis of part or all of the 
contents can be used as an ID dedicated to the 
contents . 

FIG. 19 is a view illustrating a table in which 
V-click data items are associated with contents IDs. 

Downloaded V-click data items are stored in the 
V-click storage 37 in association with respective 
contents IDs, as shown in the table of FIG. 19. 

FIG. 20 is a view illustrating a table in which 
URLs indicating the locations of V-click data items are 
associated with contents IDs. 

In this example, downloaded V-click data is stored 
in the V-click storage 37 in the video reproduction 
apparatus 11. However, V-click data items themselves 
may be left in a server 7 on the Internet 6, and the 
video reproduction apparatus 11 may store URLs of those 
data items in association with contents IDs correspond- 
ing thereto. In this case, the V-click storage 37 
stores, for example, the table as shown in FIG. 20. 

In the table of FIG. 20, a file with an extension 
"elk" downloaded from the URL of each V-click data item 
is a library file. For example, this file is developed 
as schedule data, mask data and/or object data in 
association with a corresponding contents ID, as in 
the table of FIG. 19. 

Although V-click data in ENAV contents is 
described in the embodiment, the same can be said of 
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any other data- in ENAV contents. 

FIG. 21 is a flowchart useful in explaining the 
manner of downloading V-click data. Specifically, this 
flowchart illustrates a series of operations performed 
5 after a script in DVD contents 101 recorded on a DVD 

100 is executed upon a user request using, for example, 
a button click, until V-click data is downloaded from 
ENAV contents 103 stored in a server 7 designated by 
the script. 

10 FIG. 21 illustrates an example in which the video 

reproduction apparatus 11 is connected to the Internet 
6, and V-click data is downloaded from a server 7 on 
the Internet 6 that provides ENAV contents 103. 
A user request is input to the user event 
15 controller 33 (step S401) . The. user event controller 

33, in turn, outputs the user request to the handler 34 
(step S402) . The handler 34 outputs the user request 
to the parser/interpreter 35 (step S403) . 

The parser/interpreter 35 reads a script 
20 designated by the user request from ENAV contents 101 

recorded on the DVD 100, and executes it (step S404) . 
The video reproduction apparatus 11 is connected to 
a server 7 on the Internet 6 that provides ENAV 
contents 103 and is designated by a URL in the script. 
25 The parser/interpreter 35 certifies the server 7 

(step S405) . 

It is determined whether or not the 
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parser/interpreter 35 has succeeded in the certifica- 
tion of the server 7 (step 3406) . If the certification 
has succeeded, the program proceeds to a step S407, 
whereas if it has failed, this program is finished. At 
the step S407, the parser/interpreter 35 supplies the 
server 7 with a request for downloading V-click data. 

If the video reproduction apparatus 11 has 
received V-click data as a response to the request from 
the server 7, the parser/interpreter 35 certifies the 
V-click data {step S408) . 

It is determined whether or not the parser/ 
interpreter 35 has succeeded in the certification of 
the V-click data (step S409) . If the certification has 
succeeded, the program proceeds to a step S410, whereas 
if it has failed, this program is finished. At the 
step S410, the V-click data received by the video 
reproduction apparatus 11 is stored in the V-click 
storage 37 in association with a corresponding 
contents ID. 

If the server 7 is not certified at the step S406, 
or if the V-click data is not certified at the step 
S409, an error message may be displayed on the monitor 
apparatus 8 . 

FIG. 21 shows an example in which both the server 
7 that provides ENAV contents 103 and V-click data are 
certified. However, the certification of the server 7 
that provides ENAV contents 103 or that of V-click 
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data, or both may be omitted. Further, even if means 
for storing downloaded ENAV contents 103, such as the 
V-click storage 37, is not employed, either the server 
7 that provides ENAV contents 103 or V-click data, or 
both may be certified. 

The certification of V-click data is advantageous 
in the following point: 

If the specifications of V-click data are 
generally known and no certification is performed, 
anyone can theoretically create and use V-click data 
corresponding to arbitrary DVD contents 102. This may 
raise problems in copyrights or commercial morals. 
For example, any one can establish a link with a famous 
movie without permission of the owner of the copyright 
of the movie, thereby leading, to their E-commerce 
site, people who would like to see the movie. 
Therefore, it is desirable to certify V-click data in 
order to, for example, secure the quality of a link 
provided by V-click data and cater for users. The same 
can be said of the certification of a server 7 that 
provides ENAV contents 103. 

In the second embodiment, a server 7 that provides 
ENAV contents 103 stores a certain secret key S^, and 
the parser/interpreter 35 prestores a public key 
corresponding to the server secret key S^. The public 
key Pk is stored in, for example, a ROM area during 
manufacture . 
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A server 7 that provides ENAV contents 103 encodes 
the hash value H of V-click data using a contents ID 
(CID) . The resultant data is expressed as E (CID) [H] . 
After that, the server 7 further encodes E (CID) [H] 
5 using S k , thereby obtaining data A = E(S k ) [E (CID) [H] ] . 

The server 7 supplies the V-click data with data A to 
the video reproduction apparatus 11, data A being used 
as a signature. 

FIG. 22 is a flowchart illustrating certification 
10 of V-click data. 

The video reproduction apparatus 11 receives 
V-click data with data A from the server 7, and 
supplies them to the parser/interpreter 35 (step S501). 
The parser/interpreter 35 computes the hash value H of 
15 the V-click data (step S502). To compute the hash 

value, the same function as that used in the server 7 
that provides ENAV contents 103 is used. 

Subsequently, the parser/interpreter 35 decodes 
data A using P k , thereby obtaining certain data (step 
20 S503) . This data is expressed as .D ( P k ) [A] . After 

that, the parser/interpreter 35 decides D(P] C ) [A] using 
CID, thereby obtaining D(CID) [D(P k ) [A]] (step S504) . 

The parser/interpreter 35 determines whether 
or not the previously computed H is identical to 
25 D(CID) [D(P k ) [A]] (step S505) . If they are determined 

to be identical at the step S505, the program proceeds 
to a step S506, whereas if they are determined not to 
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be identical at the step S505, the program proceeds to 
a step S507. At the step S506, the V-click data has 
been certified. On the other hand, at the step S507, 
the V-click data has not been certified. At the step 
5 S505, function C (a, b) is used to determine whether 

or not the previously computed H is identical to 
D(CID) [D(P k ) [A]] . When using function C (a, b) , "a" 
is compared with "b", and "1" is output if they are 
identical, whereas "0" is output if they are not 
10 identical. In other words, the parser/interpreter 

35 determines whether function C (D(CID) [D(P k ) [A]],H) 
is 1 or 0. 

The above-described certification procedure means 
the following: 

15 . E(CID) [H] is data that depends upon V-click data 

and CID. The encoding operation using can be 
executed only by a server 7 that provides legitimate 
ENAV contents 103. Accordingly, E(S k ) [E(CID) [H] ] 
can be created only by the server 7 that provides 

20 legitimate ENAV contents 103. If the video 

reproduction apparatus 11 correctly decodes a signature 
E(S k ) [E(CID) [H] ] and confirms that the decoding result 
is identical to H, (i) the V-click data is certified 
to be legitimate, and (ii) it is confirmed that the 

25 V-click data corresponds to the contents with the 

contents ID (CID) . 

The certification method of the invention is 
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characterized in that a server 7 for providing ENAV 
contents 103 supplies the video reproduction apparatus 
11 with a value obtained using V-click data, contents 
ID and a predetermined algorithm, and in that the video 
5 reproduction apparatus 11 certifies the legitimacy of 

the V-click data by comparing the supplied value with 
a value obtained using V-click data, contents ID and 
the same predetermined algorithm as the above. 

The internal processing portion of the video 

10 reproduction apparatus of the embodiment can be 

realized only by hardware such as a semiconductor chip. 
Alternatively, a CPU is installed in the video 
reproduction apparatus and part or all of processing 
can be executed by software. 

15 Although the above-described embodiments employ 

a DVD 100, a video reproduction apparatus that uses 
another recording medium can be embodied. 

Further, the video reproduction apparatus can 
process video contents and corresponding ENAV contents 

20 103 that are obtained via a communication medium, such 

as the Internet. 

The flow charts of the embodiments illustrate 
methods and systems according to the embodiments of the 
invention. It will be understood that each block of 

25 the flowchart illustrations, and combinations of blocks 

in the flowchart illustrations, can be implemented by 
computer program instructions. These computer program 
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instructions may be loaded onto a computer or other 
programmable apparatus to produce a machine, such that 
the instructions which execute on the computer or other 
programmable apparatus create means for implementing 
the functions specified in the flowchart block or 
blocks. These computer program instructions may also 
be stored in a computer-readable memory that can direct 
a computer or other programmable apparatus to function 
in a particular manner, such that the instruction 
stored in the computer-readable memory produce an 
article of manufacture including instruction means 
which implement the function specified in the flowchart 
block of blocks. The computer program instructions may 
also be loaded onto a computer or other programmable 
apparatus to cause a series of operational steps to . 
be performed on the computer or other programmable 
apparatus to produce a computer programmable apparatus 
provide steps for implementing the functions specified 
in the flowchart block or blocks. 

Additional advantages and modifications will 
readily occur to those skilled in the art. Therefore, 
the invention in its broader aspects is not limited to 
the specific details and representative embodiments 
shown and described herein. Accordingly, various 
modifications may be made without departing from the 
spirit or scope of the general inventive concept as 
defined by the appended claims and their equivalents. 



